Azure HDInsight
Azure HDInsight is Microsoft's distribution of Hadoop. The Azure HDInsight ecosystem includes the following features/components: Pig, Hive, Hbase, Sqoop, Oozie, Ambari, Microsoft Avro Library, YARN, Cluster Dashboard and Tez.
Apart from the above listed features/components, there are a few other components which enable reporting and analytics on top of data present in Azure HDInsight. These components include the following:
More information: http://azure.microsoft.com/en-us/documentation/articles/hdinsight-introduction
Here are few highlights of Azure HDInsight:
- Azure HDInsight is based on Hortonworks Data Platform.
- Azure HDInsight enables Apache Hadoop as a service in Microsoft Azure cloud thereby leveraging all the benefits of cloud computing.
- Azure HDInsight offers strong support for PowerShell via HDInsight PowerShell Cmdlets.
- Windows Azure and HDInsight PowerShell Cmdlets can be used to perform various activities including uploading, downloading, movement of data to and from Azure Blob Storage and On-Premises file systems, configuring/executing/post-processing jobs on HDInsight, and other related activities.
- Azure HDInsight being a Hadoop service in the cloud, one can provision a cluster, process the data, and destroy the cluster and pay for only the resources used.
- Microsoft also offers an HDInsight Emulator which allows developers to explore HDInsight on-premises without requiring an Azure Account.
Links & Additional Information
Getting Started